Search CORE

61 research outputs found

gcodeml: A Grid-enabled Tool for Detecting Positive Selection in Biological Evolution

Author: Castella Briséïs
Kuzniar Arnold
Maffioletti Sergio
Moretti Sébastien
Murri Riccardo
Robinson-Rechavi Marc
Salamin Nicolas
Stockinger Heinz
Publication venue
Publication date: 01/01/2012
Field of study

One of the important questions in biological evolution is to know if certain changes along protein coding genes have contributed to the adaptation of species. This problem is known to be biologically complex and computationally very expensive. It, therefore, requires efficient Grid or cluster solutions to overcome the computational challenge. We have developed a Grid-enabled tool (gcodeml) that relies on the PAML (codeml) package to help analyse large phylogenetic datasets on both Grids and computational clusters. Although we report on results for gcodeml, our approach is applicable and customisable to related problems in biology or other scientific domains.Comment: 10 pages, 4 figures. To appear in the HealthGrid 2012 con

arXiv.org e-Print Archive

Serveur académique lausannois

ProGMap: an integrated annotation resource for protein orthology

Author: He Ying
Kuzniar Arnold
Leunissen Jack A. M.
Lin Ke
Nijveen Harm
Pongor Sándor
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

Current protein sequence databases employ different classification schemes that often provide conflicting annotations, especially for poorly characterized proteins. ProGMap (Protein Group Mappings, http://www.bioinformatics.nl/progmap) is a web-tool designed to help researchers and database annotators to assess the coherence of protein groups defined in various databases and thereby facilitate the annotation of newly sequenced proteins. ProGMap is based on a non-redundant dataset of over 6.6 million protein sequences which is mapped to 240 000 protein group descriptions collected from UniProt, RefSeq, Ensembl, COG, KOG, OrthoMCL-DB, HomoloGene, TRIBES and PIRSF. ProGMap combines the underlying classification schemes via a network of links constructed by a fast and fully automated mapping approach originally developed for document classification. The web interface enables queries to be made using sequence identifiers, gene symbols, protein functions or amino acid and nucleotide sequences. For the latter query type BLAST similarity search and QuickMatch identity search services have been incorporated, for finding sequences similar (or identical) to a query sequence. ProGMap is meant to help users of high throughput methodologies who deal with partially annotated genomic data

PubMed Central

Wageningen University & Research Publications

Selectome update: quality control and computational improvements to a database of positive selection

Author: Castella Briséïs
Gharib Walid H.
Kuzniar Arnold
Laurenczy Balazs
Moretti Sébastien
Robinson-Rechavi Marc
Salamin Nicolas
Schabauer Hannes
Stockinger Heinz
Studer Romain A.
Valle Mario
Publication venue
Publication date: 02/08/2017
Field of study

Selectome (http://selectome.unil.ch/) is a database of positive selection, based on a branch-site likelihood test. This model estimates the number of nonsynonymous substitutions (dN) and synonymous substitutions (dS) to evaluate the variation in selective pressure (dN/dS ratio) over branches and over sites. Since the original release of Selectome, we have benchmarked and implemented a thorough quality control procedure on multiple sequence alignments, aiming to provide minimum false-positive results. We have also improved the computational efficiency of the branch-site test implementation, allowing larger data sets and more frequent updates. Release 6 of Selectome includes all gene trees from Ensembl for Primates and Glires, as well as a large set of vertebrate gene trees. A total of 6810 gene trees have some evidence of positive selection. Finally, the web interface has been improved to be more responsive and to facilitate searches and browsin

RERO DOC Digital Library

Interoperability and FAIRness through a novel combination of Web technologies

Author: Bolleman Jerven T.
Bonino da Silva Santos Luiz Olavo
Ciccarese Paolo
Clark Tim
Dumontier Michel
Gavai Anand
Gray Alasdair J. G.
Kaliyaperumal Rajaram
Kelpin Fleur D. L.
Kuzniar Arnold
Schultes Erik A.
Swertz Morris A.
Thompson Mark
van Mulligen Erik M.
Verborgh Ruben
Wilkinson Mark D.
Publication venue: 'PeerJ'
Publication date: 01/01/2017
Field of study

Data in the life sciences are extremely diverse and are stored in a broad spectrum of repositories ranging from those designed for particular data types (such as KEGG for pathway data or UniProt for protein data) to those that are general-purpose (such as FigShare, Zenodo, Dataverse or EUDAT). These data have widely different levels of sensitivity and security considerations. For example, clinical observations about genetic mutations in patients are highly sensitive, while observations of species diversity are generally not. The lack of uniformity in data models from one repository to another, and in the richness and availability of metadata descriptions, makes integration and analysis of these data a manual, time-consuming task with no scalability. Here we explore a set of resource-oriented Web design patterns for data discovery, accessibility, transformation, and integration that can be implemented by any general- or special-purpose repository as a means to assist users in finding and reusing their data holdings. We show that by using off-the-shelf technologies, interoperability can be achieved atthe level of an individual spreadsheet cell. We note that the behaviours of this architecture compare favourably to the desiderata defined by the FAIR Data Principles, and can therefore represent an exemplar implementation of those principles. The proposed interoperability design patterns may be used to improve discovery and integration of both new and legacy data, maximizing the utility of all scholarly outputs

Maastricht University Research Portal

Heriot Watt Pure

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Ghent University Academic Bibliography

Directory of Open Access Journals

Dissertations of the University of Groningen

Semi-quantitative proteomics of mammalian cells upon short-term exposure to nonionizing electromagnetic fields

Author: Bezstarosti K. (Karel)
Dekkers D.H. (Dick)
Demmers J.A.A. (Jeroen)
Eppink B. (Berina)
Kanaar R. (Roland)
Kuzniar A. (Arnold)
Laffeber C.
Lebbink J.H.G. (Joyce)
Woelders H. (Henri)
Zwamborn A.P.M. (Adrianus)
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/02/2017
Field of study

The potential effects of non-ionizing electromagnetic fields (EMFs), such as those emitted by power-lines (in extremely low frequency range), mobile cellular systems and wireless networking devices (in radio frequency range) on human health have been intensively researched and debated. However, how exposure to these EMFs may lead to biological changes underlying possible health effects is still unclear. To reveal EMF-induced molecular changes, unbiased experiments (without a priori focusing on specific biological processes) with sensitive readouts are required. We present the first proteome-wide semi-quantitative mass spectrometry analysis of human fibroblasts, osteosarcomas and mouse embryonic stem cells exposed to three types of non-ionizing EMFs (ELF 50 Hz, UMTS 2.1 GHz and WiFi 5.8 GHz). We performed controlled in vitro EMF exposures of metabolically labeled mammalian cells followed by reliable statistical analyses of differential protein-and pathway-level regulations using an array of established bioinformatics methods. Our results indicate that less than 1% of the quantitated human or mouse proteome responds to the EMFs by small changes in protein abundance. Further network-based analysis of the differentially regulated proteins did not detect significantly perturbed cellular processes or pathways in human and mouse cells in response to ELF, UMTS or WiFi exposure. In conclusion, our extensive bioinformatics analyses of semi-quantitative mass spectrometry data do not support the notion that the short-time exposures to non-ionizing EMFs have a consistent biologically significant bearing on mammalian cells in culture

Erasmus University Digital Repository

eggNOG v3.0: orthologous groups covering 1133 organisms at 41 different taxonomic ranges

Author: A. Roth
Altenhoff
C. von Mering
Chen
Chen
Ciccarelli
Creevey
D. Szklarczyk
Eisen
Gabaldon
Hulsen
I. Letunic
J. Muller
K. Trachana
Koonin
Kuzniar
L. J. Jensen
Linard
M. Kuhn
Makarova
Milinkovitch
P. Bork
Pearson
R. Arnold
S. Powell
T. Doerks
T. Rattei
Tatusov
Tatusov
Trachana
van der Heijden
von Mering
Wapinski
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

Orthologous relationships form the basis of most comparative genomic and metagenomic studies and are essential for proper phylogenetic and functional analyses. The third version of the eggNOG database (http://eggnog.embl.de) contains non-supervised orthologous groups constructed from 1133 organisms, doubling the number of genes with orthology assignment compared to eggNOG v2. The new release is the result of a number of improvements and expansions: (i) the underlying homology searches are now based on the SIMAP database; (ii) the orthologous groups have been extended to 41 levels of selected taxonomic ranges enabling much more fine-grained orthology assignments; and (iii) the newly designed web page is considerably faster with more functionality. In total, eggNOG v3 contains 721 801 orthologous groups, encompassing a total of 4 396 591 genes. Additionally, we updated 4873 and 4850 original COGs and KOGs, respectively, to include all 1133 organisms. At the universal level, covering all three domains of life, 101 208 orthologous groups are available, while the others are applicable at 40 more limited taxonomic ranges. Each group is amended by multiple sequence alignments and maximum-likelihood trees and broad functional descriptions are provided for 450 904 orthologous groups (62.5%)

Crossref

University of Birmingham Research Portal

PubMed Central

Copenhagen University Research Information System

ZORA

MDC Repository

Finished Genome of the Fungal Wheat Pathogen Mycosphaerella graminicola Reveals Dispensome Structure, Chromosome Plasticity, and Stealth Pathogenesis.

Author: Aerts Andrea
Antoniw John
Bailey Andy
Bluhm Burt
Bowler Judith
Bristow Jim
Canto-Canché Blondy
Churchill Alice CL
Conde-Ferràez Laura
Cools Hans J
Coutinho Pedro M
Crane Charles F
Csukai Michael
de Vries Ronald P
De Wit Pierre
Dehal Paramvir
Dhillon Braham
Donzelli Bruno
Foster Andrew J
Goodwin Stephen B
Grigoriev Igor V.
Grimwood Jane
Hammond-Kosack Kim E
Hane James K
Henrissat Bernard
Kema Gert HJ
Kilan Andrzej
Kobayashi Adilson K
Koopmann Edda
Kourmpetis Yiannis
Kuzniar Arnold
Lindquist Erika
Lombard Vincent
M\u27Barek Sarrah Ben
Maliepaard Chris
Martins Natalia
Mehrabi Rahim
Nap Jan PH
Oliver Richard P
Ponomarenko Alisa
Rudd Jason J
Salamov Asaf
Schmutz Jeremy
Schouten Henk J
Shapiro Harris
Stergiopoulos Ioannis
Torriani Stefano FF
Tu Hank
van de Geest Henri C
van der Burgt Ate
Van der Lee Theo AJ
van Ham Roeland CHJ
Waalwijk Cees
Ware Sara B
Wiebenga Ad
Wittenberg Alexander HJ
Zwiers Lute-Harm
Publication venue: Purdue University
Publication date: 01/01/2011
Field of study

The plant-pathogenic fungus Mycosphaerella graminicola (asexual stage: Septoria tritici) causes septoria tritici blotch, a disease that greatly reduces the yield and quality of wheat. This disease is economically important in most wheat-growing areas worldwide and threatens global food production. Control of the disease has been hampered by a limited understanding of the genetic and biochemical bases of pathogenicity, including mechanisms of infection and of resistance in the host. Unlike most other plant pathogens, M. graminicola has a long latent period during which it evades host defenses. Although this type of stealth pathogenicity occurs commonly in Mycosphaerella and other Dothideomycetes, the largest class of plant-pathogenic fungi, its genetic basis is not known. To address this problem, the genome of M. graminicolawas sequenced completely. The finished genome contains 21 chromosomes, eight of which could be lost with no visible effect on the fungus and thus are dispensable. This eight-chromosome dispensome is dynamic in field and progeny isolates, is different from the core genome in gene and repeat content, and appears to have originated by ancient horizontal transfer from an unknown donor. Synteny plots of the M. graminicola chromosomes versus those of the only other sequenced Dothideomycete, Stagonospora nodorum, revealed conservation of gene content but not order or orientation, suggesting a high rate of intra-chromosomal rearrangement in one or both species. This observed “mesosynteny” is very different from synteny seen between other organisms. A surprising feature of the M. graminicolagenome compared to other sequenced plant pathogens was that it contained very few genes for enzymes that break down plant cell walls, which was more similar to endophytes than to pathogens. The stealth pathogenesis of M. graminicola probably involves degradation of proteins rather than carbohydrates to evade host defenses during the biotrophic stage of infection and may have evolved from endophytic ancestors

Repository for Publications and Research Data

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Wageningen University & Research Publications

Purdue E-Pubs

espace@Curtin

Explore Bristol Research

pbg-ld

Author: Kuzniar Arnold
Publication venue: Zenodo
Publication date: 23/02/2023
Field of study

Linked Data Platform for Plant Breeding & Genomics.</p

PIQMIe

Author: Kuzniar Arnold
Publication venue: Zenodo
Publication date: 21/11/2015
Field of study

PIQMIe is a web-based tool for reliable analysis and visualization of semi-quantitative mass spectrometry (MS)-based proteomics data. This tool readily integrates peptide and (non-redundant) protein identifications and quantitations, as obtained by the MaxQuant/Andromeda software with additional biological information from the UniProtKB database, and makes the linked data available in the form of a light-weight relational database (SQLite). Using the web interface, users are presented with a concise summary of their proteomics experiments in numerical and graphical forms, as well as with a searchable protein grid and interactive visualization tools to aid in the rapid assessment of the experiments and in the identification of proteins of interest. PIQMIe provides data access via a web interface and programmatic RESTful API.</p